Overview
Brought to you by YData
Dataset statistics
| Number of variables | 11 |
|---|---|
| Number of observations | 645 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 55.6 KiB |
| Average record size in memory | 88.2 B |
Variable types
| Numeric | 10 |
|---|---|
| Categorical | 1 |
Bare Nuclei is highly overall correlated with Bland Chromatin and 7 other fields | High correlation |
Bland Chromatin is highly overall correlated with Bare Nuclei and 7 other fields | High correlation |
Class is highly overall correlated with Bare Nuclei and 8 other fields | High correlation |
Clump Thickness is highly overall correlated with Bare Nuclei and 7 other fields | High correlation |
Marginal Adhesion is highly overall correlated with Bare Nuclei and 7 other fields | High correlation |
Mitoses is highly overall correlated with Class and 1 other fields | High correlation |
Normal Nucleoli is highly overall correlated with Bare Nuclei and 7 other fields | High correlation |
Single Epithelial Cell Size is highly overall correlated with Bare Nuclei and 7 other fields | High correlation |
Uniformity of Cell Shape is highly overall correlated with Bare Nuclei and 7 other fields | High correlation |
Uniformity of Cell Size is highly overall correlated with Bare Nuclei and 8 other fields | High correlation |
Sample code number has unique values | Unique |
Reproduction
| Analysis started | 2025-01-04 19:18:41.686358 |
|---|---|
| Analysis finished | 2025-01-04 19:18:51.507755 |
| Duration | 9.82 seconds |
| Software version | ydata-profiling vv4.12.1 |
| Download configuration | config.json |
Variables
Sample code number
Real number (ℝ)
Unique 
| Distinct | 645 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1074418.7 |
| Minimum | 61634 |
|---|---|
| Maximum | 13454352 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.2 KiB |
Quantile statistics
| Minimum | 61634 |
|---|---|
| 5-th percentile | 411622.4 |
| Q1 | 871549 |
| median | 1171795 |
| Q3 | 1238186 |
| 95-th percentile | 1333987.4 |
| Maximum | 13454352 |
| Range | 13392718 |
| Interquartile range (IQR) | 366637 |
Descriptive statistics
| Standard deviation | 637262.66 |
|---|---|
| Coefficient of variation (CV) | 0.59312323 |
| Kurtosis | 245.40606 |
| Mean | 1074418.7 |
| Median Absolute Deviation (MAD) | 104816 |
| Skewness | 13.459679 |
| Sum | 6.9300003 × 108 |
| Variance | 4.061037 × 1011 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 13454352 | 1 | 0.2% |
| 61634 | 1 | 0.2% |
| 63375 | 1 | 0.2% |
| 76389 | 1 | 0.2% |
| 95719 | 1 | 0.2% |
| 1350423 | 1 | 0.2% |
| 1350319 | 1 | 0.2% |
| 1348851 | 1 | 0.2% |
| 1347943 | 1 | 0.2% |
| 1347749 | 1 | 0.2% |
| Other values (635) | 635 |
| Value | Count | Frequency (%) |
| 61634 | 1 | |
| 63375 | 1 | |
| 76389 | 1 | |
| 95719 | 1 | |
| 128059 | 1 | |
| 142932 | 1 | |
| 144888 | 1 | |
| 145447 | 1 | |
| 160296 | 1 | |
| 167528 | 1 |
| Value | Count | Frequency (%) |
| 13454352 | 1 | |
| 8233704 | 1 | |
| 1371920 | 1 | |
| 1371026 | 1 | |
| 1369821 | 1 | |
| 1368882 | 1 | |
| 1368273 | 1 | |
| 1368267 | 1 | |
| 1365328 | 1 | |
| 1365075 | 1 |
Clump Thickness
Real number (ℝ)
High correlation 
| Distinct | 10 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.5844961 |
| Minimum | 1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 4 |
| Q3 | 6 |
| 95-th percentile | 10 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.8236592 |
|---|---|
| Coefficient of variation (CV) | 0.61591484 |
| Kurtosis | -0.70065714 |
| Mean | 4.5844961 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.52835969 |
| Sum | 2957 |
| Variance | 7.9730512 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 127 | |
| 1 | 120 | |
| 3 | 97 | |
| 4 | 74 | |
| 10 | 69 | |
| 8 | 46 | 7.1% |
| 2 | 44 | 6.8% |
| 6 | 33 | 5.1% |
| 7 | 22 | 3.4% |
| 9 | 13 | 2.0% |
| Value | Count | Frequency (%) |
| 1 | 120 | |
| 2 | 44 | 6.8% |
| 3 | 97 | |
| 4 | 74 | |
| 5 | 127 | |
| 6 | 33 | 5.1% |
| 7 | 22 | 3.4% |
| 8 | 46 | 7.1% |
| 9 | 13 | 2.0% |
| 10 | 69 |
| Value | Count | Frequency (%) |
| 10 | 69 | |
| 9 | 13 | 2.0% |
| 8 | 46 | 7.1% |
| 7 | 22 | 3.4% |
| 6 | 33 | 5.1% |
| 5 | 127 | |
| 4 | 74 | |
| 3 | 97 | |
| 2 | 44 | 6.8% |
| 1 | 120 |
Uniformity of Cell Size
Real number (ℝ)
High correlation 
| Distinct | 10 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.2542636 |
| Minimum | 1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 5 |
| 95-th percentile | 10 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 3.0878194 |
|---|---|
| Coefficient of variation (CV) | 0.9488535 |
| Kurtosis | -0.089989774 |
| Mean | 3.2542636 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.1548188 |
| Sum | 2099 |
| Variance | 9.5346285 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 337 | |
| 10 | 65 | 10.1% |
| 3 | 51 | 7.9% |
| 2 | 44 | 6.8% |
| 4 | 40 | 6.2% |
| 5 | 30 | 4.7% |
| 8 | 28 | 4.3% |
| 6 | 25 | 3.9% |
| 7 | 19 | 2.9% |
| 9 | 6 | 0.9% |
| Value | Count | Frequency (%) |
| 1 | 337 | |
| 2 | 44 | 6.8% |
| 3 | 51 | 7.9% |
| 4 | 40 | 6.2% |
| 5 | 30 | 4.7% |
| 6 | 25 | 3.9% |
| 7 | 19 | 2.9% |
| 8 | 28 | 4.3% |
| 9 | 6 | 0.9% |
| 10 | 65 | 10.1% |
| Value | Count | Frequency (%) |
| 10 | 65 | 10.1% |
| 9 | 6 | 0.9% |
| 8 | 28 | 4.3% |
| 7 | 19 | 2.9% |
| 6 | 25 | 3.9% |
| 5 | 30 | 4.7% |
| 4 | 40 | 6.2% |
| 3 | 51 | 7.9% |
| 2 | 44 | 6.8% |
| 1 | 337 |
Uniformity of Cell Shape
Real number (ℝ)
High correlation 
| Distinct | 10 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.3348837 |
| Minimum | 1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 5 |
| 95-th percentile | 10 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 3.005085 |
|---|---|
| Coefficient of variation (CV) | 0.90110638 |
| Kurtosis | -0.181504 |
| Mean | 3.3348837 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.0803957 |
| Sum | 2151 |
| Variance | 9.0305359 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 307 | |
| 2 | 57 | 8.8% |
| 3 | 56 | 8.7% |
| 10 | 56 | 8.7% |
| 4 | 43 | 6.7% |
| 5 | 33 | 5.1% |
| 7 | 30 | 4.7% |
| 8 | 28 | 4.3% |
| 6 | 28 | 4.3% |
| 9 | 7 | 1.1% |
| Value | Count | Frequency (%) |
| 1 | 307 | |
| 2 | 57 | 8.8% |
| 3 | 56 | 8.7% |
| 4 | 43 | 6.7% |
| 5 | 33 | 5.1% |
| 6 | 28 | 4.3% |
| 7 | 30 | 4.7% |
| 8 | 28 | 4.3% |
| 9 | 7 | 1.1% |
| 10 | 56 | 8.7% |
| Value | Count | Frequency (%) |
| 10 | 56 | 8.7% |
| 9 | 7 | 1.1% |
| 8 | 28 | 4.3% |
| 7 | 30 | 4.7% |
| 6 | 28 | 4.3% |
| 5 | 33 | 5.1% |
| 4 | 43 | 6.7% |
| 3 | 56 | 8.7% |
| 2 | 57 | 8.8% |
| 1 | 307 |
Marginal Adhesion
Real number (ℝ)
High correlation 
| Distinct | 10 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.9255814 |
| Minimum | 1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 4 |
| 95-th percentile | 10 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.9235677 |
|---|---|
| Coefficient of variation (CV) | 0.99931171 |
| Kurtosis | 0.66566178 |
| Mean | 2.9255814 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.4252717 |
| Sum | 1887 |
| Variance | 8.5472483 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 362 | |
| 2 | 55 | 8.5% |
| 10 | 55 | 8.5% |
| 3 | 54 | 8.4% |
| 4 | 32 | 5.0% |
| 8 | 24 | 3.7% |
| 5 | 23 | 3.6% |
| 6 | 22 | 3.4% |
| 7 | 13 | 2.0% |
| 9 | 5 | 0.8% |
| Value | Count | Frequency (%) |
| 1 | 362 | |
| 2 | 55 | 8.5% |
| 3 | 54 | 8.4% |
| 4 | 32 | 5.0% |
| 5 | 23 | 3.6% |
| 6 | 22 | 3.4% |
| 7 | 13 | 2.0% |
| 8 | 24 | 3.7% |
| 9 | 5 | 0.8% |
| 10 | 55 | 8.5% |
| Value | Count | Frequency (%) |
| 10 | 55 | 8.5% |
| 9 | 5 | 0.8% |
| 8 | 24 | 3.7% |
| 7 | 13 | 2.0% |
| 6 | 22 | 3.4% |
| 5 | 23 | 3.6% |
| 4 | 32 | 5.0% |
| 3 | 54 | 8.4% |
| 2 | 55 | 8.5% |
| 1 | 362 |
Single Epithelial Cell Size
Real number (ℝ)
High correlation 
| Distinct | 10 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.296124 |
| Minimum | 1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 2 |
| Q3 | 4 |
| 95-th percentile | 8 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 2.2438461 |
|---|---|
| Coefficient of variation (CV) | 0.68075292 |
| Kurtosis | 1.888946 |
| Mean | 3.296124 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.6370874 |
| Sum | 2126 |
| Variance | 5.0348452 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 346 | |
| 3 | 70 | 10.9% |
| 4 | 47 | 7.3% |
| 6 | 40 | 6.2% |
| 1 | 39 | 6.0% |
| 5 | 39 | 6.0% |
| 10 | 30 | 4.7% |
| 8 | 20 | 3.1% |
| 7 | 12 | 1.9% |
| 9 | 2 | 0.3% |
| Value | Count | Frequency (%) |
| 1 | 39 | 6.0% |
| 2 | 346 | |
| 3 | 70 | 10.9% |
| 4 | 47 | 7.3% |
| 5 | 39 | 6.0% |
| 6 | 40 | 6.2% |
| 7 | 12 | 1.9% |
| 8 | 20 | 3.1% |
| 9 | 2 | 0.3% |
| 10 | 30 | 4.7% |
| Value | Count | Frequency (%) |
| 10 | 30 | 4.7% |
| 9 | 2 | 0.3% |
| 8 | 20 | 3.1% |
| 7 | 12 | 1.9% |
| 6 | 40 | 6.2% |
| 5 | 39 | 6.0% |
| 4 | 47 | 7.3% |
| 3 | 70 | 10.9% |
| 2 | 346 | |
| 1 | 39 | 6.0% |
Bare Nuclei
Real number (ℝ)
High correlation 
| Distinct | 10 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.655814 |
| Minimum | 1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 7 |
| 95-th percentile | 10 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 3.6804821 |
|---|---|
| Coefficient of variation (CV) | 1.0067476 |
| Kurtosis | -0.9303434 |
| Mean | 3.655814 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.92146991 |
| Sum | 2358 |
| Variance | 13.545948 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 369 | |
| 10 | 129 | 20.0% |
| 5 | 30 | 4.7% |
| 3 | 28 | 4.3% |
| 2 | 28 | 4.3% |
| 8 | 22 | 3.4% |
| 4 | 18 | 2.8% |
| 9 | 9 | 1.4% |
| 7 | 8 | 1.2% |
| 6 | 4 | 0.6% |
| Value | Count | Frequency (%) |
| 1 | 369 | |
| 2 | 28 | 4.3% |
| 3 | 28 | 4.3% |
| 4 | 18 | 2.8% |
| 5 | 30 | 4.7% |
| 6 | 4 | 0.6% |
| 7 | 8 | 1.2% |
| 8 | 22 | 3.4% |
| 9 | 9 | 1.4% |
| 10 | 129 | 20.0% |
| Value | Count | Frequency (%) |
| 10 | 129 | 20.0% |
| 9 | 9 | 1.4% |
| 8 | 22 | 3.4% |
| 7 | 8 | 1.2% |
| 6 | 4 | 0.6% |
| 5 | 30 | 4.7% |
| 4 | 18 | 2.8% |
| 3 | 28 | 4.3% |
| 2 | 28 | 4.3% |
| 1 | 369 |
Bland Chromatin
Real number (ℝ)
High correlation 
| Distinct | 10 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.5364341 |
| Minimum | 1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 5 |
| 95-th percentile | 8 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.4521488 |
|---|---|
| Coefficient of variation (CV) | 0.69339587 |
| Kurtosis | 0.024438557 |
| Mean | 3.5364341 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.0346854 |
| Sum | 2281 |
| Variance | 6.0130338 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 153 | |
| 3 | 152 | |
| 1 | 128 | |
| 7 | 71 | |
| 4 | 40 | 6.2% |
| 5 | 34 | 5.3% |
| 8 | 28 | 4.3% |
| 10 | 19 | 2.9% |
| 9 | 10 | 1.6% |
| 6 | 10 | 1.6% |
| Value | Count | Frequency (%) |
| 1 | 128 | |
| 2 | 153 | |
| 3 | 152 | |
| 4 | 40 | 6.2% |
| 5 | 34 | 5.3% |
| 6 | 10 | 1.6% |
| 7 | 71 | |
| 8 | 28 | 4.3% |
| 9 | 10 | 1.6% |
| 10 | 19 | 2.9% |
| Value | Count | Frequency (%) |
| 10 | 19 | 2.9% |
| 9 | 10 | 1.6% |
| 8 | 28 | 4.3% |
| 7 | 71 | |
| 6 | 10 | 1.6% |
| 5 | 34 | 5.3% |
| 4 | 40 | 6.2% |
| 3 | 152 | |
| 2 | 153 | |
| 1 | 128 |
Normal Nucleoli
Real number (ℝ)
High correlation 
| Distinct | 10 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.9937984 |
| Minimum | 1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 4 |
| 95-th percentile | 10 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 3.132175 |
|---|---|
| Coefficient of variation (CV) | 1.0462211 |
| Kurtosis | 0.16658864 |
| Mean | 2.9937984 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.3202562 |
| Sum | 1931 |
| Variance | 9.8105205 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 396 | |
| 10 | 61 | 9.5% |
| 3 | 41 | 6.4% |
| 2 | 35 | 5.4% |
| 8 | 24 | 3.7% |
| 6 | 21 | 3.3% |
| 5 | 18 | 2.8% |
| 4 | 17 | 2.6% |
| 7 | 16 | 2.5% |
| 9 | 16 | 2.5% |
| Value | Count | Frequency (%) |
| 1 | 396 | |
| 2 | 35 | 5.4% |
| 3 | 41 | 6.4% |
| 4 | 17 | 2.6% |
| 5 | 18 | 2.8% |
| 6 | 21 | 3.3% |
| 7 | 16 | 2.5% |
| 8 | 24 | 3.7% |
| 9 | 16 | 2.5% |
| 10 | 61 | 9.5% |
| Value | Count | Frequency (%) |
| 10 | 61 | 9.5% |
| 9 | 16 | 2.5% |
| 8 | 24 | 3.7% |
| 7 | 16 | 2.5% |
| 6 | 21 | 3.3% |
| 5 | 18 | 2.8% |
| 4 | 17 | 2.6% |
| 3 | 41 | 6.4% |
| 2 | 35 | 5.4% |
| 1 | 396 |
Mitoses
Real number (ℝ)
High correlation 
| Distinct | 9 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.6325581 |
| Minimum | 1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 6 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.7753996 |
|---|---|
| Coefficient of variation (CV) | 1.0874955 |
| Kurtosis | 11.448962 |
| Mean | 1.6325581 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.4081605 |
| Sum | 1053 |
| Variance | 3.1520439 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 527 | |
| 2 | 35 | 5.4% |
| 3 | 31 | 4.8% |
| 10 | 14 | 2.2% |
| 4 | 12 | 1.9% |
| 7 | 9 | 1.4% |
| 8 | 8 | 1.2% |
| 5 | 6 | 0.9% |
| 6 | 3 | 0.5% |
| Value | Count | Frequency (%) |
| 1 | 527 | |
| 2 | 35 | 5.4% |
| 3 | 31 | 4.8% |
| 4 | 12 | 1.9% |
| 5 | 6 | 0.9% |
| 6 | 3 | 0.5% |
| 7 | 9 | 1.4% |
| 8 | 8 | 1.2% |
| 10 | 14 | 2.2% |
| Value | Count | Frequency (%) |
| 10 | 14 | 2.2% |
| 8 | 8 | 1.2% |
| 7 | 9 | 1.4% |
| 6 | 3 | 0.5% |
| 5 | 6 | 0.9% |
| 4 | 12 | 1.9% |
| 3 | 31 | 4.8% |
| 2 | 35 | 5.4% |
| 1 | 527 |
Class
Categorical
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.2 KiB |
| benign | |
|---|---|
| malignant |
Length
| Max length | 9 |
|---|---|
| Median length | 6 |
| Mean length | 7.0930233 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | benign |
|---|---|
| 2nd row | malignant |
| 3rd row | malignant |
| 4th row | malignant |
| 5th row | benign |
Common Values
| Value | Count | Frequency (%) |
| benign | 410 | |
| malignant | 235 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| benign | 410 | |
| malignant | 235 |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 1290 | |
| g | 645 | |
| i | 645 | |
| a | 470 | 10.3% |
| b | 410 | 9.0% |
| e | 410 | 9.0% |
| m | 235 | 5.1% |
| l | 235 | 5.1% |
| t | 235 | 5.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4575 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 1290 | |
| g | 645 | |
| i | 645 | |
| a | 470 | 10.3% |
| b | 410 | 9.0% |
| e | 410 | 9.0% |
| m | 235 | 5.1% |
| l | 235 | 5.1% |
| t | 235 | 5.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4575 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 1290 | |
| g | 645 | |
| i | 645 | |
| a | 470 | 10.3% |
| b | 410 | 9.0% |
| e | 410 | 9.0% |
| m | 235 | 5.1% |
| l | 235 | 5.1% |
| t | 235 | 5.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4575 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 1290 | |
| g | 645 | |
| i | 645 | |
| a | 470 | 10.3% |
| b | 410 | 9.0% |
| e | 410 | 9.0% |
| m | 235 | 5.1% |
| l | 235 | 5.1% |
| t | 235 | 5.1% |
Interactions
Correlations
| Bare Nuclei | Bland Chromatin | Class | Clump Thickness | Marginal Adhesion | Mitoses | Normal Nucleoli | Sample code number | Single Epithelial Cell Size | Uniformity of Cell Shape | Uniformity of Cell Size | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Bare Nuclei | 1.000 | 0.684 | 0.837 | 0.594 | 0.696 | 0.474 | 0.651 | -0.122 | 0.679 | 0.745 | 0.762 |
| Bland Chromatin | 0.684 | 1.000 | 0.799 | 0.541 | 0.630 | 0.383 | 0.668 | -0.095 | 0.646 | 0.702 | 0.733 |
| Class | 0.837 | 0.799 | 1.000 | 0.732 | 0.736 | 0.511 | 0.761 | 0.000 | 0.780 | 0.852 | 0.869 |
| Clump Thickness | 0.594 | 0.541 | 0.732 | 1.000 | 0.537 | 0.416 | 0.570 | -0.021 | 0.571 | 0.667 | 0.667 |
| Marginal Adhesion | 0.696 | 0.630 | 0.736 | 0.537 | 1.000 | 0.447 | 0.634 | -0.054 | 0.671 | 0.711 | 0.750 |
| Mitoses | 0.474 | 0.383 | 0.511 | 0.416 | 0.447 | 1.000 | 0.498 | -0.077 | 0.475 | 0.465 | 0.507 |
| Normal Nucleoli | 0.651 | 0.668 | 0.761 | 0.570 | 0.634 | 0.498 | 1.000 | -0.074 | 0.706 | 0.725 | 0.757 |
| Sample code number | -0.122 | -0.095 | 0.000 | -0.021 | -0.054 | -0.077 | -0.074 | 1.000 | -0.081 | -0.056 | -0.040 |
| Single Epithelial Cell Size | 0.679 | 0.646 | 0.780 | 0.571 | 0.671 | 0.475 | 0.706 | -0.081 | 1.000 | 0.755 | 0.788 |
| Uniformity of Cell Shape | 0.745 | 0.702 | 0.852 | 0.667 | 0.711 | 0.465 | 0.725 | -0.056 | 0.755 | 1.000 | 0.894 |
| Uniformity of Cell Size | 0.762 | 0.733 | 0.869 | 0.667 | 0.750 | 0.507 | 0.757 | -0.040 | 0.788 | 0.894 | 1.000 |
Missing values
Sample
| Sample code number | Clump Thickness | Uniformity of Cell Size | Uniformity of Cell Shape | Marginal Adhesion | Single Epithelial Cell Size | Bare Nuclei | Bland Chromatin | Normal Nucleoli | Mitoses | Class | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 61634 | 5 | 4 | 3 | 1 | 2 | 1 | 2 | 3 | 1 | benign |
| 1 | 63375 | 9 | 1 | 2 | 6 | 4 | 10 | 7 | 7 | 2 | malignant |
| 2 | 76389 | 10 | 4 | 7 | 2 | 2 | 8 | 6 | 1 | 1 | malignant |
| 3 | 95719 | 6 | 10 | 10 | 10 | 8 | 10 | 7 | 10 | 7 | malignant |
| 4 | 128059 | 1 | 1 | 1 | 1 | 2 | 5 | 5 | 1 | 1 | benign |
| 5 | 142932 | 7 | 6 | 10 | 5 | 3 | 10 | 9 | 10 | 2 | malignant |
| 6 | 144888 | 8 | 10 | 10 | 8 | 5 | 10 | 7 | 8 | 1 | malignant |
| 7 | 145447 | 8 | 4 | 4 | 1 | 2 | 9 | 3 | 3 | 1 | malignant |
| 8 | 160296 | 5 | 8 | 8 | 10 | 5 | 10 | 8 | 10 | 3 | malignant |
| 9 | 167528 | 4 | 1 | 1 | 1 | 2 | 1 | 3 | 6 | 1 | benign |
| Sample code number | Clump Thickness | Uniformity of Cell Size | Uniformity of Cell Shape | Marginal Adhesion | Single Epithelial Cell Size | Bare Nuclei | Bland Chromatin | Normal Nucleoli | Mitoses | Class | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 635 | 1365075 | 4 | 1 | 4 | 1 | 2 | 1 | 1 | 1 | 1 | benign |
| 636 | 1365328 | 1 | 1 | 2 | 1 | 2 | 1 | 2 | 1 | 1 | benign |
| 637 | 1368267 | 5 | 1 | 1 | 1 | 2 | 1 | 1 | 1 | 1 | benign |
| 638 | 1368273 | 1 | 1 | 1 | 1 | 2 | 1 | 1 | 1 | 1 | benign |
| 639 | 1368882 | 2 | 1 | 1 | 1 | 2 | 1 | 1 | 1 | 1 | benign |
| 640 | 1369821 | 10 | 10 | 10 | 10 | 5 | 10 | 10 | 10 | 7 | malignant |
| 641 | 1371026 | 5 | 10 | 10 | 10 | 4 | 10 | 5 | 6 | 3 | malignant |
| 642 | 1371920 | 5 | 1 | 1 | 1 | 2 | 1 | 3 | 2 | 1 | benign |
| 643 | 8233704 | 4 | 1 | 1 | 1 | 1 | 1 | 2 | 1 | 1 | benign |
| 644 | 13454352 | 1 | 1 | 3 | 1 | 2 | 1 | 2 | 1 | 1 | benign |